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RAW SEQUENCE LISTING DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:38 

Input Set : A:\TJU.ST25.txt 

Output Set: N:\CRF4\01202004\I357675D.raw 

3 <110> APPLICANT: Croce, Carlo 

5 <120> TITLE OF INVENTION: Nitrilase Homologs 

7 <130> FILE REFERENCE: TJU-2510 

9 <140> CURRENT APPLICATION NUMBER: 09/357,6750 
10 <141> CURRENT FILING DATE: 1999-07-20 I\ n E 

12 <150> PRIOR APPLICATION NUMBER: 60/093,350 F N 1 C 

13 <151> PRIOR FILING DATE: 1998-07-20 " 
15 <160> NUMBER OF SEQ ID NOS : 31 

17 <170> SOFTWARE: PatentIn version 3.2 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 1416 

21 <212> TYPE: DNA 

22 <213> ORGANISM: cDNA Sequence 

25 <220> FEATURE: 

26 <221> NAME/KEY: misc_feature 

27 <222> LOCATION: (19).. (19) 

28 <22 3> OTHER INFORMATION: n=a 
30 <400> SEQUENCE: 1 

W — > 31 gcccactcgc tgcggcctnt ctggctccag accgccctcc ggatcggacc ctgcgaatgg 



60 



33 ttttggctat atcttcatgt aggacctact ccctatcccg tcggccgcgg ctgggcttca 120 
35 tcaccaggcc tcctcacaga ttcctgtccc ttctgtgtcc tggactccgg atacctcaac 
37 tctcagtact ttgtgctcag cccaggccca gagccatggc tatctcctct tcctcctgcg 
39 aactgcccct ggtggctgtg tgccaggtaa catcgacgcc agacaagcaa cagaacttta 
41 aaacatgtgc tgagctggtt cgagaggctg ccagactggg tgcctgcctg gctttcctgc 
43 ctgaggcatt tgacttcatt gcacgggacc ctgcagagac gctacacctg tctgaaccac 
45 tgggtgggaa acttttggaa gaatacaccc agcttgccag ggaatgtgga ctctggctgt 
47 ccttgggtgg tttccatgag cgtggccaag actgggagca gactcagaaa atctacaatt 
4 9 gtcacgtgct gctgaacagc aaaggggcag tagtggccac ttacaggaag acacatctgt 
51 gtgacgtaga gattccaggg caggggccta tgtgtgaaag caactctacc atgcctgggc 660 
53 ccaqtcttga gtcacctgtc agcacaccag caggcaagat tggtctagct gtctgctatg 720 
^ ... 780 

840 
900 
960 
1020 
1080 



180 
240 
300 
360 
420 
480 
540 
600 



55 acatgcggtt ccctgaactc tctctggcat tggctcaagc tggagcagag atacttacct 

57 atccttcagc ttttggatcc attacaggcc cagcccactg ggaggtgttg ctgcgggccc 

59 gtgctatcga aacccagtgc tatgtagtgg cagcagcaca gtgtggacgc caccatgaga 

61 agagagcaag ttatggccac agcatggtgg tagacccctg gggaacagtg gtggcccgct 

63 gctctgaggg gccaggcctc tgccttgccc gaatagacct caactatctg cgacagttgc 

65 gccgacacct gcctgtgttc cagcaccgca ggcctgacct ctatggcaat ctgggtcacc 

67 cactgtctta agacttgact tctgtgagtt tagacctgcc cctcccaccc ccaccctgcc 114 0 

69 actatgagct agtgctcatg tgacttggag gcaggatcca ggcacagctc ccctcacttg 1200 

71 gagaaccttg actctcttga tggaacacag atgggctgct tgggaaagaa actttcacct 1260 

73 gagcttcacc tgaggtcaga ctgcagtttc agaaaggtgg aattttatat agtcattgtt 1320 

75 tatttcatgg aaactgaagt tctgctgagg gctgagcagc actggcattg aaaaatataa 1380 

77 taatcataaa gtcaaaaaaa aaaaaaaaaa aaaaaa - 1^16 

80 <210> SEQ ID NO: 2 
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RAW SEQUENCE LISTING DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:38 
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81 <211> LENGTH: 23 

82 <212> TYPE: DNA 

83 <213> ORGANISM: Homo sapiens 

85 <4 00> SEQUENCE: 2 

86 tctgaaactg cagtctgacc tea 

89 <210> SEQ ID NO: 3 

90 <211> LENGTH: 21 

91 <212> TYPE: DNA 

92 <213> ORGANISM: Homo sapiens 

94 <4 00> SEQUENCE: 3 

95 caggcacagc tcccctcact t 

98 <210> SEQ ID NO: 4 

99 <211> LENGTH: 20 

100 <212> TYPE: DNA 

101 <213> ORGANISM: Homo sapiens 

104 <220> FEATURE: 

105 <221> NAME/KEY: misc_feature 

106 <222> LOCATION: (3) . . (3) 

107 <223> OTHER INFORMATION: n is a, c, g, or t 

109 <220> FEATURE: 

110 <221> NAME/KEY: misc_feature 

111 <222> LOCATION: (6).. (6) 

112 <223> OTHER INFORMATION: n is a, c, g, or t 

114 <220> FEATURE: 

115 <221> NAME/KEY: misc_feature 

116 <222> LOCATION: (9).. (9) 

117 <223> OTHER INFORMATION: n is a, c, g, or t 

119 <220> FEATURE: 

120 <221> NAME/KEY: misc_feature 

121 <222> LOCATION: (12).. (12) 

122 <223> OTHER INFORMATION: n is a, c, g, or t 

124 <220> FEATURE: 

125 <221> NAME/KEY: misc_feature 

126 <222> LOCATION: (18).. (18) 

■ 127 <223>^OTHER INFORMATION: n is a>c, 9/ or t 
129 <400> SEQUENCE: 4 
W — > 130 gtngtnccng gncaygtngt 

133 <210> SEQ ID NO: 5 

134 <211> LENGTH: 26 

135 <212> TYPE: DNA 

136 <213> ORGANISM: Homo sapiens 
139 <220> FEATURE: 

14 0 <221> NAME/KEY: misc_feature 

141 <222> LOCATION: (6).. (6) 

142 <223> OTHER INFORMATION: n is a, c, g, or t 

144 <220> FEATURE: . ' 

145 <221> NAME/KEY: misc_feature 
14 6 <222> LOCATION: (12).. (12) 

14 7 <223> OTHER INFORMATION: y=c or t 
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RAW SEQUENCE LISTING DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:38 

- Input Set : A:\TJU.ST25.txt 
Output Set: N:\CRF4\01202004\l357675D.raw 

149 <220> FEATURE: 

150 <221> NAME/KEY: misc_feature 

151 <222> LOCATION: (15).. (15) 

152 <223> OTHER INFORMATION: n is a, c, g, or t 

154 <220> FEATURE: 

155 <221> NAME/KEY: misc_f eature 

156 <222> LOCATION: (18).. (18) 

157 <223> OTHER INFORMATION: n is a, c, g, or t 

159 <220> FEATURE: 

160 <221> NAME/KEY: misc_f eature 

161 <222> LOCATION: (21) . . (21) 

162 <223> OTHER INFORMATION: y= c or t 

164 <220> FEATURE: 

165 <221> NAME/KEY: misc_f eature 

166 <222> LOCATION: (24).. (24) 

167 <223> OTHER INFORMATION: n = a,c,g, or t 
169 <400> SEQUENCE: 5 

W — > 170 acrtgnacrt gyttnacngt ytgngc 

173 <210> SEQ ID NO: 6 

174 <211> LENGTH: 21 

175 <212> TYPE: DNA 

17 6 <213> ORGANISM: Drosophila melanogaster 

178 <400> SEQUENCE: 6 

17 9 gcgcctttgt ggcctcgact g 

182 <210> SEQ ID NO: 7 

183 <211> LENGTH: 21 

184 <212> TYPE: DNA 

185 <213> ORGANISM: Drosophila melanogaster 

187 <400> SEQUENCE: 7 

188 cggtggcgga agttgtctgg t 

191 <210> SEQ ID NO: 8 

192 <211> LENGTH: 20 

193 <212> TYPE: DNA 

194 <213> ORGANISM: Caenorhabditis elegans 
196^400>==SEQUENCEr 8 " " = ^ - - - ^. 
197 gtggcggctg ctcaaactgg 

200 <210> SEQ ID NO: 9 

201 <211> LENGTH: 21 

202 <212> TYPE: DNA 

203 <213> ORGANISM: Caenorhabditis elegans 

205 <400> SEQUENCE: 9 

206 tcgcgacgat gaacaagtcg g 

209 <210> SEQ ID NO: 10 

210 <211> LENGTH: 19 

211 <212> TYPE: DNA 

212 <213> ORGANISM: Homo sapiens 

214 <400> SEQUENCE: 10 

215 gccctccgga tcggaccct 
218 <210> SEQ ID NO: 11 
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RAW SEQUENCE LISTING DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:38 

Input Set : A:\TJU.ST25.txt 

Output Set: N:\CRF4\01202004\I357675D.raw 

219 <211> LENGTH: 20 

220 <212> TYPE: DNA 

221 <213> ORGANISM: Homo sapiens 

223 <400> SEQUENCE: 11 

224 gacctactcc ctatcccgtc 

227 <210> SEQ ID NO: 12 

228 <211> LENGTH: 21 

229 <212> TYPE: DNA 

230 <213> ORGANISM: Homo sapiens 

232 <400> SEQUENCE: 12 

233 gctgcgaagt gcacagctaa g 

236 <210> SEQ ID NO: 13 

237 <211> LENGTH: 24 

238 <212> TYPE: DNA 

239 <213> ORGANISM: Homo sapiens 

241 <400> SEQUENCE: 13 

242 aaactgaagc ctctttcctc tgac 

245 <210> SEQ ID NO: 14 

246 <211> LENGTH: 20 

247 <212> TYPE: DNA 

24 8 <213> ORGANISM: Homo sapiens 

250 <400> SEQUENCE: 14 

251 tgggcttcat caccaggcct 

254 <210> SEQ ID NO: 15 

255 <211> LENGTH: 22 

256 <212> TYPE: DNA 

257 <213> ORGANISM: Homo sapiens 

259 <400> SEQUENCE: 15 

260 ctgggctgag cacaaagtac tg 

263 <210> SEQ ID NO: 16 

264 <211> LENGTH: 21 

265 <212> TYPE: DNA 

266 <213> ORGANISM: Homo sapiens 

268 <400> SEQUENCE: 16 

269 gcttgtctgg cgtcgatgtt a 

272 <210> SEQ ID NO: 17 

273 <211> LENGTH: 36 

274 <212> TYPE: DNA 

275 <213> ORGANISM: Homo sapiens 

277 <400> SEQUENCE: 17 

278 tgacgtcgac atatgtcaac tctagttaat accacg 

281 <210> SEQ ID NO: 18 

282 <211> LENGTH: 25 

283 <212> TYPE: DNA 

284 <213> ORGANISM: Homo sapiens 

286 <400> SEQUENCE: 18 

287 tgggtacctc gactagctta tgtcc 

290 <210> SEQ ID NO: 19 

291 <211> LENGTH: 147 
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RAW SEQUENCE LISTING DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:38 

Input Set : A:\TJU.ST25.txt 

Output Set: N: \CRF4\01202004\I357675D, raw 

292 <212> TYPE: PRT 

293 <213> ORGANISM: Homo sapiens 

296 <220> FEATURE: 

297 <221> NAME/KEY: misc_f eature 

298 <222> LOCATION: (82).. (82) 

299 <223> OTHER INFORMATION: Xaa is an unknown amino acid 

301 <400> SEQUENCE: 19 ^ „ . „ i 

303 Met Ser Phe Arg Phe Gly Gin" His Leu lie Lys Pro Ser Val Val Phe 

307 Leu Lys Thr Glu Leu Ser Phe Ala Leu Val Asn Arg Lys Pro Val Val 

308 20 25 30 

311 Pro Gly His Val Leu Val Cys Pro Leu Arg Pro Val Glu Arg Phe His 

3^5 Asp Leu llg Pro Asp Glu Val Ala Asp Leu Phe Gin Thr Thr Gin Arg 
316 50 55 60 

319 Val Gly Thr Val Val Glu Lys His Phe His Gly Thr Ser Leu Thr Phe 

320 65 70 75 »U 
W"> 323 ser Xaa Gin Asp Gly Pro Glu Ala Gly Gin Thr Val Lys His Val His 

324 85 90 

327 val His Val Leu Pro Arg Lys Ala Gly Asp Phe His Arg Asn Asp Ser 

328 100 105 110 

331 lie Tyr Glu Glu Leu Gin Lys His Asp Lys Glu Asp Phe Pro Ala Ser 

332 115 120 125 

335 Trp Arg Ser Glu Glu Glu Glu Ala Ala Glu Ala Ala Ala Leu Arg Val 

336 130 135 140 

339 Tyr Phe Gin 

340 145 

343 <210> SEQ ID NO: 20 

344 <211> LENGTH: 150 

345 <212> TYPE: PRT 

34 6 <213> ORGANISM: murine 

348 <400> SEQUENCE: 20 ^ o .7 i pha 

350 Met ser Phe Arg Phe Gly Gin His Leu He Lys Pro Ser Val Val Phe 



351 1 5""" " 10 1^ 

354 Leu "Lys Thr Glu Leu Ser Phe Ala Leu Val Asn Arg Lys Pro Val Val 

355 20 25 30 

358 Pro Gly His Val Leu Val Cys Pro Leu Arg Pro Val Glu Arg Phe Arg 

35 40 45 

362 ASP Leu His Pro Asp Glu Val Ala Asp Leu Phe Gin Val Thr Gin Arg 
^63 50 55 60 

366 Val Gly Thr Val Val Glu Lys His Phe Gin Gly Thr Ser He Thr Phe 

367 65 ''O "^5 «^ 

370 Ser Met Gin Asp Gly Pro Glu Ala Gly Gin Thr Val Lys His Val His 

371 85 90 y5 

374 Val His Val Leu Pro Arg Lys Ala Gly Asp Phe Pro Arg Asn Asp Asn 

375 100 105 110 

378 lie Tyr Asp Glu Leu Gin Lys His Asp Arg Glu Glu Glu Asp Ser Pro 

379 115 120 125 

382 Ala Phe Trp Arg Ser Glu Lys Glu Met Ala Ala Glu Ala Glu Ala Leu 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:39 

Input Set : A:\TJU.ST25.txt 

Output Set: N:\CRF4\01202004\l357675D.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please revxew the 
Sequence Listing to ensure that a corresponding explanation is presented in the < 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:l; N Pes. 19 
Seq#:4; N Pos, 3,6,9,12,18 
Seq#:5; N Pos. 6,15,18,24 
Seq#:19; Xaa Pos. 82 
Seq#:25; Xaa Pos. 6 
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VERIFICATION SUMMARY DATE: 01/20/2004 

PATENT APPLICATION: US/09/357 , 675D TIME: 13:24:39 



Input Set : A:\TJU.ST25.txt 

Output Set: N:\CRF4\01202004\l357675D.raw 

L:31 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 after pos . : 0 
l'-130 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos.:0 
l'i70 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:5 after pos . : 0 
l'323 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:19 after pos.:80 
1-827 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:25 after pos,:0 
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